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We explore the clustering properties of high redshift dark matter haloes, focusing 
on haloes massive enough to host early generations of stars or galaxies at redshift 10 
and greater. Haloes are extracted from an array of dark matter simulations able to 
resolve down to the "mini-halo" mass scale at redshifts as high as 30, thus encom- 
passing the expected full mass range of haloes capable of hosting luminous objects 
and sources of reionization. Halo clustering on large-scales agrees with the Sheth, Mo 
& Tormcn halo bias relation within all our simulations, greatly extending the regime 
where large-scale clustering is confirmed to be "universal" at the 10 — 20% level (which 
means, for example, that 3a haloes of cluster mass at z = have the same large-scale 
bias with respect to the mass distribution as Str haloes of galaxy mass a.t z = 10). 
However, on small-scales, the clustering of our massive haloes ( <L 10^h~^MQ) at these 
high redshifts is stronger than expected from comparisons with small-scale halo clus- 
tering extrapolated from lower redshifts. This implies "non- universality" in the scale- 
dependence of halo clustering, at least for the commonly used parameterizations of the 
scale-dependence of bias that we consider. We provide a fit for the scale-dependence of 
bias in our results. This study provides a basis for using extraordinarily high redshift 
galaxies (redshift ^^ 10) as a probe of cosmology and galaxy formation at its earliest 
stages. We show also that mass and halo kinematics are strongly affected by finite 
simulation volumes. This suggests the potential for adverse affects on gas dynamics in 
hydrodynamic simulations of limited volumes, such as is typical in simulations of the 
formation of the "first stars" , though further study is warranted. 

Key words: galaxies: haloes - galaxies: formation - methods: N-body simulations - 
cosmology: theory - cosmology: dark matter 



1 INTRODUCTION 

Dark matter haloes are formed from fluctuations in the mat- 
ter density field as characterized by the matter power spec- 
trum. A theoretical understanding of the relationship be- 
tween the matter power spectrum and the numbers and clus- 
tering of dark matter haloes provides an essential link be- 
tween cosmological parameters and the properties of haloes 
in which observable (or potentially observable) galaxies live. 
Haloes form from gravitationally induced non-linear collapse 
of mass overdensities that grow from primordial density fluc- 
tuations. Thus, halo clustering properties in a universe com- 
posed of cold dark matter are determined entirely by the 
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matter power spectrum, or equivalently, by the cosmological 
parameters. The aim of this paper is to use dark matter sim- 
ulations to quantify the relation between the matter power 
spectrum and resulting halo clustering properties, focusing 
on haloes of masses capable of star or galaxy formation dur- 
ing the era of reionization. It is beyond the scope of this 
paper to incorporate baryon physics to model galaxy for- 
mation within haloes. However, the distribution of haloes, 
which we explore, can be used as a basis for studies that 
populate haloes with galaxies to model galaxy clustering 
properties, which can then be used to probe cosmology and 
galaxy formation at high redshifts. 

Halo clustering and its evolution has been studied 
extensively by a number of authors, mainly focus- 
ing on cluster, group, or g alaxy mass scales a t low 
redshifts (e.g. IWhite et al.1 Il987l : IMo fc White! Il996l : 
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ISeliak fc Warren 



Wechsler et al. 



Wetzel et alTI l2007l: lAngulo. Baueh fc Lacevl 12008 
Basilakos. Plionis fc Ragone-Figueroa I 120081 ). However, 



few studies have explore d the halo or galaxy distribution in 



the reionizins era (e.g . Zahn et al. 20071: Coh n fc White 
l2008l : llUevet al. |[2008l : iRicotti. Gnedin fc ShuU 2008 1. an 
epoch just beginning to be targeted by a number of surveys, 
expected to provide a new probe of cosmology and the 
physics of galaxy formation. At these high redshifts, haloes 
large enough to host stars and galaxies are expected to form 
from much rarer fluctuations (i.e. collapsed from higher 
sigma overdensities) than haloes that host typical galaxies 
or even groups and clusters in the low redshift universe. 
This could lead to different halo clustering properties, 
which we will explore using numerical simulations. 

The Press fc Schechter formalism (jPress fc Schechter I 

11974 ) provides a general analytic framework for understand- 
ing halo formation, and has been improved by a num- 
ber o f subsequent authors (e.g. iBower lll99ll : lLacev fc Cole I 
I993I). Utilizing the extended Press fc Schechter formalism, 



Mo fc White I (|l996l ) developed a theoretical means of pre- 



dicting halo clustering properties from the linear matter 
power spectrum. Numerical simulations have been utilized 
by a number of the fore-mentioned authors to validate and 
improve upon these works. Recently, we used a suite of high 
resolution numerical simulations t o quantify t h e num bers 



of haloes during the reionizin g era Reed et al. (120071 ) (, 



also e.g. Lukic et al. 2007 and lHeitmann et al.1l2006l ). This 
work utilizes the same simulations to analyze the clustering 
properties of haloes. 

We argue that halo clustering is at least somewhat "uni- 
versal" with respect to the linear matter fluctuation field. If 
halo clustering is exactly "universal" , then haloes of differ- 
ent mass and redshift formed from linear fluctuations of the 
same rarity will have identical clustering properties (see § 2). 
For example, 3cr haloes of cluster mass at 2; = would have 
the same relative clustering with respect to the mass distri- 
bution as 3cr haloes of galaxy mass ai z — 10. Universality 
of halo clustering has been explored at lower redshifts by 
authors mentioned earlier, but has not been thoroughly ad- 
dressed during the reionization period. We argue later that 
approximate universality holds for large-scale halo clustering 
to redshift 30, though it breaks down at small scales for the 
more massive haloes at z ^ 10. The universality of large- 
scale clustering that we will show is a con venient feature 
(also held by the halo mass function; see e.g. Ijenkins et al. I 
I2OOII : Reed et al. 2003, 2007; Lukic et al. 2007) that should 
enable the results of our simulations, which are evolved only 
to z = 10, to be applicable to rare haloes over a wide redshift 
range. Rare, massive haloes attainable in current and future 
generations of high redshift clustering surveys, should have 
large-scale clustering strength relative to the mass, i.e. bias, 
similar to that of equally rare, massive haloes at lower red- 
shifts (e.g. clusters, QSO hosts) 

The mass range of high redshift haloes that we are 
able to resolve includes virtual ly all haloes l arge e nough to 
form stars or galaxies; see e.g. iReed et al. I (|2005|) and ref- 
erences therein for a review of high redshift star formation. 
The first stars are expected to form within "mini-haloes" of 



~ 10^ Mq where primordial gas cools and contracts mainly 
by ii'2-cooling (e.g. Abel, Bryan, fc Norman 2000,2002 ; 
Bromm, Coppi, fc Larson 1999,2002; lYoshida et 811120031 : 
lO'Shea fc Norman I [JOOTh . The "first galaxies" may form 
within haloes of virial temperatures greater than ~ lO'^K 
(~ 10^ M0 at 2 ~ 10), where gas is able to cool via 
collisionally-induced atomic hydrogen cooling, a more effec- 
tive process. One should keep in mind that metal enrichment 
or other feedback effects may have a large influence on what 
types of haloes can cool and transform gas into stars suffi- 
ciently to resemble a "galaxy" . 

Using extremely high redshift galaxies for cosmology 
has some particular advantages. Their small masses allow 
the matter power spectrum to be probed on very small 
scales. For example, warm dark matter models with filter- 
ing lengths ^ 100/i~^kpc can be tested if the abundance 
and bias of lO*/i~^M0 hosts of the "1st galaxies" can be 
measured. Additionally, "1st galaxies" can potentially en- 
able observations of the first stages of galaxy formation. 

Halo clustering can, in principle, be used to constrain 
both cosmology and the physics of galaxy formation with ex- 
tremely high redshift galaxies {z ^ 6) as foun d, for example, 
by th e Lyman-break (dropout) technique (e.g. lSteidel et al. I 
1 19991) or by the redshifte d Lyman alpha emission line 
(e.g. iTaniguchi et al. Il2005h . We note that our redshift 10 
haloes include the likely mass range of candidate 2 ~ 10 
Lyman-Q emitters found behind a lensing cluster by Stark 
et al. (2007). Because the survey volume of that study is 
highly uncertain, so is the abundance, and hence the mass, 
of dark halo hosts of these objects. As future ever more 
sensitive observations target similar galaxies, and with the 
upcoming launch of James Webb Space Telescope (JWST), 
their clustering properties should be measurable. This will 
constrain the properties of the haloes in which they live. It 
is thus essential to develop our theoretical understanding of 
halo and galaxy clustering in this regime. 

In this study, we examine halo bias in the reionizing 
universe. In § 2, we review briefiy theoretical studies of halo 
bias. In § 3, we discuss our suite of simulations and analysis 
techniques. We present measurements of halo bias and its 
scale-dependence from our simulations in § 4. In § 5, we 
analyze the kinematics of haloes and mass, and address the 
sensitivity of the pairwise velocity dispersion to the volume 
of the simulation. We conclude with a discussion of how our 
results can be used as a basis for using reionizing era galaxies 
to explore cosmology and galaxy formation. 



2 HALO CLUSTERING THEORY 

Clustering can be quantified by counting the numbers of 
pairs as a function of separation relative to that of a random 
distribution, as in the two-point correlation function. 
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Haloes can be either more or less clustered than the matter 
fiuctuation spectrum, as described by the bias, which we 
compute as 



b = \/ ihh{r) / (,mm{r) , 



(2) 



where (^hh is the halo two-point correlation function and ^mm 
is the same for the mass. The two-point correlation function 



clustering of the i^* galaxy halos 3 



is directly related to the power spectrum by the Fourier 
transform, expressed as: 



^(^) = 2^ 



„,, , sin(fcr) , 2 ,, 

Pik) — ;5^ — '-k^dk. 

kr 



(3) 



On large, linear scales, halo formation modeled by ex- 
tended Press fc Sch echter formalism (e.g. iBower I Il99ll : 
iLacev fc Cole I Il993l ) can be used to predict halo clus- 
tering. In this model, halo formation occurs when lin- 
ear fluctuations within a global Gaussian random den- 
sity field grow gravitationally to reach a critical thresh- 
old for collapse, Sc , computed for spherical overdensities. 
IMo fc White I (| 19961 ) utilized this model to predict halo bias 
and demonstrated that bias should be scale-independent 
on sufficiently large scales. In this formalism, the bias is 
clo sely related to t he mass function, which has been shown 
by ISheth fc Tormen (1999), Sheth, Mo fc Tormen (2001; 
SMT). and lSheth fc Tormen I (|20o3 ') to be better described 
by an ellipsoidal halo collapse model. The ellipsoidal collapse 
model of SMT results in large-scale bias cons istent with sim- 
ulatio ns of e.g. ISheth fc Tormen I l|l999l ) and lColberg et al. I 
l|2000l ). and given by: 

bsMT = l-\ 1=— ^{au^) + ^/abiau^)^'" (4) 

{au'^Y + b{l-c){l-c/2) 

where a = 0.707, b = 0.5, c = 0.6, u = S^/a{m), 5c = 1.686, 
and a{m) is the RMS linear overdensity in top-hat spheres 
of mass m. For an infinite volume, 



2f . D\z) 



k'^P{k)W^{k;m)Ak, 



(5) 



where Pik) is the linear power spectrum of the density fiuc- 
tuations at z = 0, W{k;m) is the Fourier transform of 
the real-space top-hat filter, and D(z) is the growth fac- 
tor of linear p erturbations normalised to unity at 2: = 
(|Peebles 1119931 ). The above bias relation is universal in the 
sense that bias depends only upon v — 5c/o{mhaio), which 
describes the "rarity" of the density fluctuation from which 
the halo is formed, independently of halo mass and redshift. 
There is an intrinsic scale-dependence of halo bias, which 
can be strong on small-scales, but vanishes on large-scales. 
The scale-dependence of bias is difficult to compute analyt- 
ically because the mass fluctuation fleld becomes nonlinear 
on small scales. It is thus advantageous to employ numerical 
simulations. 



3 TECHNIQUES AND ANALYSIS 

3.1 The simulations 

T his study consists o f a subset of the simulations presented 
in I Reed et al. I (|2007l ) , summarized in Table 13.21 These runs 
consist of dark matter particles evolved from linear initial 
conditions using a modified vers ion of the par allel N-body 
code GADGET2 (Springel 2005: ISpringel et al. 2005). The 
wide range of simulation volumes allows us to model haloes 
over a large range in mass, and to determine empirically the 
effects of finite simulation volume, discussed below. Conse- 
quently, our suite of simulations has a large enough effec- 



tive dynamic range to model haloes covering the bulk of the 
mass range able to form stars in the reionizin g era. Haloes 
are se lected by the friends-of-friends algorithm (i Pavis et al. I 
Il985i ) where all particles separated by less than 0.2 times 
the mean inter-particle separation are linked into a common 
halo. We have tested that our results are not sensitive to the 
choice of halo finder by verifying that halos defined using a 
spherical overdensity of 178 times mean, another commonly 
used halo definition, have clustering properties similar to our 
friends-of-friends halos (see § 4.1 for further discussion). 

3.2 Finite volumes and their effects on spatial bias 

The effects of the finite simulation volume might be expected 
to alter the clustering properties of the simulation. Finite 
simulation volumes lack power with wavelengths larger than 
the box, and suffer from discrete ness of power especially 



for long wavelength modes (e.g. Barkana fc Loeb 



Bagla fc Rav I l2005l: [Power fc Knebe I I2OO6I : ISirko 



Bagla fc Prasad 112009 : iReed et al. II2007I : Lukic et al. 2007) 



2004 



2005 



The resulting effects on clustering can be strong for the small 
volumes required to model the low mass haloes relevant for 
high redshift star formation. But fortunately, finite-volume 
effects are similar for dark matter and for haloes, as we 
show in Fig.[T] As a result, the finite- volume effects on halo 
bias are much weaker than the effects on absolute clustering 
properties of mass or haloes. For this reason, the halo bias 
measured within a simulation is reliable over a substantial 
range of scales smaller than the length of the box. 

Fig. [T] shows an example of the two-point correlation 
function, (,{r), for dark matter and for haloes with virial 
temperatures greater than 10* A' at 2 = 10 (approximately 
~ IQ^H-^Mq, using the relation Mio4 = 10** x [10/(1 4- 
z)^"]/i~^M0), and spanning a factor of ten in halo mass. 
The bottom panel shows halo bias for several box sizes, 
demonstrating that any systematic affects on bias due to 
finite-volume are weak provided that the scale sampled is 
sufficiently smaller than the box size. We find that as a con- 
servative choice, the bias measured from pair separations 
below O.lLbox is adequate for our desired level of accuracy 
( ^ 10%) in our simulations. The bold portion of the bias 
curves in Fig. [T] and [5] extend only to this scale. Although 
the scatter in bias is close to ~ 50% at small scales and 
large masses where numbers of pairs are small, we demon- 
strate later that we are able to measure bias at large scales 
to ^ 10%, using O.lLbox as a maximum pair separation. Our 
smallest box size of l/i~^Mpc, which might be expected to 
be most likely to suffer finite-volume effects, does not have 
enough haloes of this mass for us to compute reliably a cor- 
relation function; however, we have confirmed using smaller 
haloes that the bias found in the smallest boxes agrees with 
that in larger boxes. 

The decrease in bias at small scales is mainly due to the 
fact that it is not possible to have a halo pair separated by 
less than twice the hal o radius, sometimes referred to as the 
halo exclusion effect IjBenson et al. II2OOCI ). On very small 
scales, the uncertainty increases due to small numbers of 
closely separated pairs, as reffected by the increased scatter. 

We have just shown that halo bieis is not affected 
strongly by box size, an argument that is strengthened by 
the agreement among overlapping boxes in Fig [2] However, 
because individual realizations and different simulation vol- 
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^ runs 


Lbox 


mpart 


Npart 


fsoft 




h.-iMpc 


ft-lMg 




h-ikpc 


11 


1.0 


1.1 X 10^ 


400=* 


0.125 


1 


2.5 


1.1 X 10=* 


1000^ 


0.125 


1 


4.64 


1.1 X 10^ 


400=* 


0.58 


2 


11.6 


1.1 X 10^ 


1000^ 


0.58 


1 


20 


8.7 X 10^ 


400^ 


2.5 


2 


50 


8.7 X 10^ 


10003 


2.4 



Table 1. Summary of the simulations presented in this work. 
Nruna random realisations of cubical volumes of side Lbox were 
simulated. Npart particles of mass mpart and gravitational force 
softening length r^^it were employed. 



urties each have a unique form of cr(m), consideration of the 
universality of halo bias can be strengthened if one uses the 
relation between a and M specific to each realization as the 
effective universal mass variable. This mitigates the effects 
of missing large-scale power and reduces run-to-run scat- 
ter. For a periodic cosmological simulation, the smoothed 
rms linear overdensity, a, is given by the discrete analog of 
Eqn.El 



\m) = D\z)Y,\5^?W\k- 



(6) 



where | (5k | refers to the linear amplitude of the Fourier modes 
at « = 0, and D and W are the same as in Eqn. [5] This 
approach has been suc cessful in the para meterization of the 
halo mass function (see lReed et al. 1120071 for detailed discus- 
sion) . We discuss further in the Appendix the importance of 
considering finite volume effects for estimates of large-scale 
clustering. 



4 RESULTS: HALO BIAS 

4.1 Scale-dependence 

We present halo bias as a function of scale for haloes of a 
range of masses at redshift 10 in Fig. O At large scales, the 
halo bias approaches the number-weighted large-scale bias 
predictions of SMT, given by 



bsMT{> rnmin) — n{> mminY 



bsMT{m) -— dm, (7) 
am 



where bsMT{m) is given in eqn. 4, and we tak e dn/dm to be 
the mass function given bv lReed et al. I ((20071), Eqn. 11. We 
explore further the large-scale bias in the next subsection. 
First, we show the scale-dependence and compare with fits 
from clusters in simulations of much lower redshifts. The 
scale-dependence has been parameterized in terms of the 
halo pair separation, r: 

b{m,r,z) = bsTim,z)[l + bsrim, z)a{r, z)]" , (8) 

wit h a = 0.15 b y 'Hamana et al. I (120011 ) and a = 0.35 
bv iDiaferio et al. I ( 2 003), where bsr is the bias relation 
of ISheth fc Tormen I (|l999f ). which is similar to bs MT- The 
steeper scale-dependence of IDiaferio et al. I (|2003t ) is con- 
sistent with our small haloes, but is not steep enough to 
match the most massive haloes in our simulations. The 



iHamana et al. I (|2001h bias relation (not pl otted) is much 
shallower than that of IDiaferio et al. I (|2003r ), making for a 
poor ma tch to the scale-dependence in our data. The bias 
fit of Tinker et al. I ( 200 5), which is parameterized in terms 
of the non-linear mass correlation function, ^(r), rather 
than the linear variance in spheres, as in the above rela- 
tion, implies e v en we aker scale dependence than that of 
iHamana et al. I (|200ll ) when extrapolated to our mass and 
redshift range, and so is not consistent with our results. 

It requires a large extrapolation to apply low redshift 
fits of cluster bias to our higher redshifts, lower masses, 
and much rarer (larger v) haloes than previously tested, 
so it is perhaps not surprising that we find a steeper scale- 
dependence for many of our haloes. Even so, the reasonable 
level of agreement for our lower mass haloes suggests "par- 
tial universality". Our haloes at redshift 10 are well fit by 
the following relation: 

b{m,r,z) = bsMT{m,z)[l + 0.03bsMT{m, z)^a{r, z)'^] (9) 

(ignoring halo exclusion effects at small separations). We 
have checked that our fit remains consistent with our sim- 
ulation outputs from redshifts 10 to 30, though the uncer- 
tainties and scatter become greater at high redshifts. This 
fit, however, is not intended to be valid at redshifts lower 
than we have consid ered. In fact, if extrapo lated to the lower 
redshifts probed bv IDiaferio et al. I (|2003l ). our fit has sig- 
nificantly steeper scale-dependence than their simulations, 
refiecting non-universality. We have confirmed the lack of 
universality of this parameterization of the scale-dependent 
bias using separate simulations of larger volumes evolved to 
low redshifts. Though the halo samples considered here were 
divided into cumulative samples for illustrative purposes, 
the above fit agrees with the bias obtained when samples 
are restricted to a much narrower mass range of a factor 
of two. For visual clarity, we have chosen to plot in Fig. [5] 
the value of bsMT for an infinite volume rather than the 
individual finite- volume corrected value for each simulation. 
We demonstrate in the Appendix that the finite-volume cor- 
rected large-scale bias of each realization is a much better 
description of the bias measured in simulations, particularly 
for the smallest boxes. 

A number of factors may affect the universality of the 
scale-dependence of halo bias. The slope of the linear power 
spectrum on halo pre-coUapse scales is much steeper in our 
simulations because the halo masses are smaller relative to 
that typical of low redshift simulations. The steeper power 
spectrum may affect halo clustering (see e.g. Jing 1998|) and 
cause the apparent non-universality with respect to low red- 
shift clusters. Another contributing factor may be that the 
effective definition of a halo changes based on its concentra- 
tion and environment (Lukic et al. 2008). The low concentra- 
tions and dominance of large filamentary structure at high 
redshift (Gao et al. 2005) may cause the halo finder to com- 
bine or split haloes differently relative to their Lagrangian 
(linear) initial density fluctuations. We have checked the ef- 
fect of halo definition by verifying that the bias of haloes de- 
fined using a spherical overdensity of 178 times mean is con- 
sistent, though perhaps slightly higher an d steeper, as migh t 
be expected from their lower abundance (JReed et al. II2007I ) . 
Further theoretical work regarding halo formation is needed 
to understand fully the scale-dependence of halo clustering. 
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Figure 1. Top panel: Correlation function for haloes (gh/i, solid) and partieles {£,mm, dashed) for all haloes of mass ~ lO*~^/i~^M0 
extracted from a range of simulation volumes, at 2; = 10. Here, and in subsequent figures, box size and particle mass labels have units 
of h~^Mpc and /i~^Mq, respectively. Error bars on ^/jj, (solid) are 1 — a Poisson from halo pair counts per bin. Bottom panel: Bias, 
(€/i?i/Cm"i)^ , for a range of simulation volumes. Bold portions denote the most reliable range of scales for the computed bias, extending 
to approximately O.lh]^^^; the agreement of different box sizes demonstrates the weakness of finite volume effects on halo bias. 



We leave the search for a more universal parameterization 
of the scale-dependence of bias to future studies. 



4.2 Large-scale bias 

We compute the large-scale bias for our halo sample in Fig. 
[3l The SMT bias relation is consistent with our data at all 
redshifts while the Mo & White bias relation is inconsistent 



with our results. By going to very high redshift, we are able 
to measure the bias of haloes formed from ~ Aa fluctuations, 
representing a major improvement upon previous bias mea- 
surements at hi gh redshift. In particular, the M illennium run 
bias data (from lGao. Springel. fc White II2005I ) extend to 3(t 
haloes at z = 5, and lCohn fc White 1 1|2008|) compute bias for 
a narr ower mass range at z = 10. lAngulo. Baugh. fc Lacev I 
(|2008t ) measure bias for ~ 4.5cr halos, but at redshifts of 3 
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Figure 2. Bias, (^/i/i/i^mm)^ " at 2: = 10 for a range of halo masses. The bold portions of the curves denote the most rehable range 
of scales for each box (see text), and long-dashed (red) is our fit (neglecting halo exclusion at small scales). For larger haloes, our bias 
scale-dependence is steeper than extrapolations of low redshift fits from Diaferio et al. (2003) (short-dashed, black) and others (see text). 
Our large-scale bias agrees with the prediction by Shcth, Mo, & Tormen (2001), denoted by the horizontal dotted (black) lines. 



and below. While we have not explicitly plotted uncertain- 
ties, we note that the scatter among different simulations 
can be used to estimate visually an ensemble bootstrap er- 
ror. This scatter suggests that the data are accurate to about 
10% in bias (ignoring any potential systematics) . Given the 
scatter within our data, it is not possible to distinguish be- 
tween t he SMT bias relat i on an d that of more p recise fits 
such as ISeliak fc Warren I l|2004l ). [Tinker et alTI |200i), or 



the bias relation that would be derived using the mass func- 
tion fit to these simulations (Reed_et aL 2007). 

The agreement with the plotted Millennium data 
(jSpringel et al. I |2005|) for our 2 — 3cr haloes confirms 
universality and suggests that our box-size correction 
technique is successful. The evidence for universality 
of halo bias is strengt h ened by our agreement with 
lAngulo. Baugh. fc Lacev I l)2008l ) for 2 — ia haloes, where 
their results agree with the SMT bias. However, the source 
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of the ~ 20% disagreement between our data and the lower 
red shift bias at i^ ~ 1.5 from th e MiUe nnium run, with which 
the lAngulo. Baueh. fc Lacev I ()2008l ) bias appears to also 
agree, is not clear. Although this could indicate departure 
from universality at the ~20% level (in bias), because the 
differences occur at small masses in the smallest boxes where 
the finite- volume corrections are largest (see Appendix), po- 
tential systematics are not easily ruled out. Our work veri- 
fies that estimates of halo bias derived from extended Press- 
Schechter theory and variants are valid for extremely rare 
haloes. Additionally, our comparison with other studies at 
lower redshifts significantly extends the confirmed regime of 
approximate universality (at the 10 — 20% level) in mass, 
redshift, and v of large-scale bias. 

Some care must be taken when computing bias from 
haloes in finite volumes. Because the scale-dependence of 
the bias extends to scales approaching the maximum pair 
separations where bias measurements remain robust, a sim- 
ple averaging of the bias over the reliably sampled range in 
scales would result in a systematic error (i.e. a "bias bias"). 
We thus utilize the scale dependence of bias presented ear- 
lier to fit more reliably the large scale bias. The large-scale 
bias that is plotted in Fig.[3]is then computed by performing 
a least squares fit using Eqn.[5]and replacing bsMT with the 
bias to be fit, as follows: 



10 



b{m, 



■bfit{m,z)[l + G.O'ibfit{m,z) a{r,z) ]. 



(10) 



To avoid peculiarities of bias for closely separated pairs 
where numbers are small and bias is steepest, bias is then 
fit over pairs separated by a range of 0.033 — O.lLbox, which 
also remains within the range shown previously to be most 
robust against finite box effec ts. 

In lCohn fc White ll|2008h . the large-scale bias was taken 
to be the bias at 1.5/i~^Mpc. They caution that they may 
not have captured the bias at a scale large enough to refiect 
the asymptotic large-scale value. Indeed, their large-scale 
bias estimates are higher than ours over the range of overlap, 
covering 3-3. 7o" haloes, also ai z = 10. Their bias estimates 
are ~ 10-20% higher than the SMT bias relation, whereas 
our results agree with the SMT bias. These differences are 
consistent with the overestimate in bias expected when es- 
timating the large-scale bias from the value at 1.5/i~^Mpc 
(see Fig. [21). 



5 RESULTS: HALO PAIRWISE VELOCITY 
DISPERSION 

The kinematics of high redshift galaxies represent an ad- 
ditional probe of cosmology and galaxy formation, poten- 
tiall y more sensitive than the s patial correlation function 
fe.g. lZhao. Jing. fc Borner 1 12002'). The pairwise velocity dis- 
persion, (T«i2(r), has been utilized extensively in l ow redshift 
cosmological surveys (e.g. lDavis fc Peebles Ill983l ). (J„i2(r) is 
defined as the one dimensional rms velocity of pair members 
in the direction of the line-of-sight connecting the pair, sepa- 
rated by distance r. From Fig.|3]it can be seen that the pair- 
wise velocity dispersion of both mass and haloes tends to be 
suppressed by finite simulation volume. Particles and haloes 
of identical mass in different size volumes have significantly 
different velocities; note that the pairwise velocity dispersion 
has an inherent mass-dependence, so objects of similar mass 
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Figure 3. Large-scale bias, {£,hh / ^mmY ' '^ , shown as a function 
oi V = <y{'miiaio)/^c compared with the theoretical predictions 
of Mo & White (1996) and Sheth, Mo & Tormen (2001). Bias 
is computed in bins of factors of 2 in halo mass, using a least- 
squares fit that assumes the scale-dependence described by Eqn. 
[9] Millennium run data are taken from Gao et al. (2005). 



must be considered. In the case of haloes, some kinematic 
effects could be due to the difference in average halo mass 
of the sample d ue to the fini te volume suppression of high 
mass haloes fsee lReed et al. I I20Q7). Pairwise dispersions for 
the ~ W^K halo sample, (covering ~ 10*~^/i~^Mq) at sep- 
arations of Q.2/i~^Mpc (comoving) range from ~ 75km s~^ 
in the 12/i~^Mpc volume to ~ 50km s~^ in the 5/i~^Mpc 
volume, dropping to ~ 30km s~^ in the 2.5/i~^Mpc volume. 
The finite- volume effect on pairwise velocities decreases with 
close pair separation, presumably because the infall velocity 
of close pairs is dominated by the total mass of the pair. 

Particle and halo kinematics are affected at different 
levels by finite volumes, altering the pairwise velocity disper- 
sion bias, 6„i2, defined here as the ratio of cr„i2(r) for haloes 
compared to particles {(7vi2,hh/o'vi2,mm)- Velocity bias is 
shown in Fig [5] for the same haloes shown in the cluster- 
ing bias plot (Fig[5|. Run-to-run scatter in velocity bias be- 
comes increasingly large on small scales. This is in contrast 
to the halo spatial bias, which is largely free of finite volume 
effects on scales below ~ O.lLbox, despite similarly strong 
effects individually on particle and halo clustering strength. 
We leave further development of techniques for correcting 
for effects of finite simulation volume on kinematic effects 
to future works. 



6 DISCUSSION AND CONCLUSIONS 

We have presented results for the clustering of haloes span- 
ning the mass range likely to host stars and galaxies at red- 
shift ten. Ultimately, the clustering properties of high red- 
shift galaxies may be used as a probe of cosmology and also 
as a probe of the physics of high redshift galaxy formation. 
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Figure 4. Top panel: Pairwise velocity dispersion of halo pairs 
('''t;l2,hh(r), solid) and particle pairs (o"t)i2,mm(r), dashed) for all 
haloes with mass ~ 10*~^/i~^Mq extracted from a range of sim- 
ulation volumes, at z = 10 (the same sample as Fig. [TJ. The 
finite- volume effect on pairwise velocities decreases with close pair 
separation, presumably because the infall velocity of close pairs is 
dominated by the total mass of the pair. Bottom panel: Pairwise 
velocity bias, (T„i2 hh/^vi2,mm for a range of simulation volumes. 
Bold portions denote the most reliable range of scales previously 
estimated for spatial bias (extending to approximately O.lLtox)- 
The relative disagreement in pairwise velocities and bias for simu- 
lations of different volume indicate significant finite volume effects 
in the kinematics of mass and haloes. 



Though we focus our discussion on the reionizing era, clus- 
tering properties are potentially a valuable tool at any red- 
shift, particularly for rare haloes where the bias is high with 
strong scale-dependence, and the sensitivity to cosmological 
parameters and halo mass is strong 

As an example, it is worth considering the simplest case 
where there is a one-to-one relation between halo mass and 
galaxy luminosity, i.e. if Lhaioim, z) has no scatter, and ex- 
actly one galaxy occupies each halo. Here, a measurement 
of the luminosity function would allow one to halo derive 
the halo mass for an assumed cosmology directly from the 
mass function. The inferred mass then determines the halo 
bias which, in turn, can be used to constrain cosmological 
parameters, as illustrated in Fig. [6] Note that determin- 
ing the galaxy bias from observations requires a simulta- 
neous measurement of both matter clustering and galaxy 
clustering, a difficulty which we later discuss. Since all po- 
tentially observable objects at extremely high redshifts live 
in haloes much larger than M«, the cosmology dependence 
of bias remains large for all observationally attainable halo 
abundances. In contrast, at low redshifts, the cosmology de- 
pendence of bias grows weak for small abundances. For the 
most abundant haloes, a bias measurement of 20% accuracy 
could distinguish between a low as of 0.76 and a high erg 
of 0.9 if measured from haloes of abundance ~ l/i^Mpc""^ 
(~ lO*~^/i~^M0). Although neither of these normalizations 
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Figure 5. Pairwise velocity bias, o"i,i2,hh/o'iii2,mmi at z = 10 
for the same halo samples as in Fig. [2] The bold portions of the 
curves denote the range of pair separation where spatial bias was 
found to be relatively robust. In contrast to spatial bias, however, 
kinematic bias is affected significantly by finite simulation volume 
because particle and halo velocities are suppressed at different 
levels. 



are favored by current CMB data, they are in fact within 
■ -^ (2 — 3)cr of the re cent WMAP 5 year normalization 
iJKomatsu et al. II2008I ). Moreover, the point remains that 
bias is generally more sensitive to cosmology at high red- 
shifts, and at a wide range of halo abundances and mass. 

The bias-abundance relation may remain largely un- 
altere d even if IGM absorption is significant. Illiev et al. I 
(2008), for the case of high redshift Ly-a sources, showed 
that dimming due to IGM absorption does not affect signif- 
icantly the relation between observed number density and 
source clustering. Despite the likelihood of i ncreasing galaxy 
clust ering at fixed apparent luminosity ([McQuinn et al. I 
120071 ) by altering the apparent luminosity-mass relation, 
clustering at fixed number density is not affected strongly 
by IGM absorption. This means that the high optical depth 
of the high redshift IGM should not diminish the potential 
of using reionizing era galaxies for cosmology. 

The discussion so far is intended to highlight only the 
potential for use of high redshift galaxy clustering mea- 
surements to constrain cosmology. In practice, to deter- 
mine halo bias, one must measure not only the strength 
of galaxy clustering, but also the strength of matter clus- 
tering. There is the possibility of measuring matter clus- 
tering at these redshifts using 21cm tomography, although 
it will be a challenge because of complex astrophysical ef- 
fects inc luding sources of Lym a n-g and ionizing photons 
(see e.g. iBarkana fc Loebl l2005l : iFurlanetto. Oh. fc Briggsl 
120061 : IShaw fc Lewis II2008I ). For this reason, galaxy cluster- 
ing measurements may have more value as an indicator of 
the mass of the halos in which the galaxies reside, once one 
has determined the cosmological parameters by other means. 
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Figure 6. Bias, {£,hh/£,mmY''^ at 2 = 10 as a function of cumu- 
lative halo number density for 2 cosmologies with different values 
of (7g and Us- The bias relation used is the Sheth, Mo & Tormen 
(2001) fit, applicable to large scales (see Fig.[2ll- Halo number den- 
sity is a fit to simulations from Reed et al. (2007), Eqn. 11. Halo 
bias at 2 = 10 is generally more sensitive to cosmology than at 
low redshifts, and remains so over a wide range in halo abundance 
(or halo mass). 



of re ionization bubbles (e.g. Illiev et al. I I2OO6I : IZahn et al. I 
I2OO71) may correlate with g alaxy clustering on large scales 
(e.g. IWvithe fc Loeb 1 12007| ). which leads to the possibil- 
ity of reionization affecting gal axy formation and galaxy 
clustering on large scales (e.g. iGnedin I I2OO0I ). Or, mul- 
tiple galaxies may occupy massive haloes, which creates 
a degeneracy in galaxy abundance between satellite num- 
bers and halo mass. Similarly, there exists a degeneracy 
between halo mass and satellite numbers in the scale- 
dependence of clustering on small scales. These degeneracies 
are difficult to overcome observationally because of the need 
for clust ering measurements on very smaU ang ular scales 
(see e.g. iBuUock. Wechsler. fc Somerville I l2002h . However, 
one can use theoretical arguments, semi-analytic modelling 
and/or smaller volume simulations utilizing hydrodynamics 
and other detailed physics to infer basic galaxy properties 
such as the numbers, luminosities , and environmental in- 
flueri c es (e.g.lMcQuinn et al. II2007I: JMesinger fc Furlanetto I 



I2OO7I : Illiev et al. I I2OO6I : iRicotti. Gnedin fc ShuU I l20o'i r 
Progress may then be made by combining the results of 
such detailed models of galaxy formation with determina- 
tions of halo clustering from dark matter simulations in or- 
der to produce detailed predictions for observable clustering 
properties. This will help to break the degeneracies between 
galaxy "feedback" and halo clustering on small scales, as well 
as determine whether haloes tend to host multiple luminous 
galaxies. As ever deeper observations are made, such studies 
will allow us to probe galaxy formation at its earliest stages 
and to explore cosmology at the high redshift frontier. 



The steep mass-dependence of both the halo number den- 
sity and the halo bias at these high redshifts increases their 
power as a means of linking galaxy properties to halo mass. 
If there is scatter in the Lhaioi'm) relation, the re- 
latio n between abundance and m ass no longer holds 
(e.g. IWhite. Martini, fc Cohn I 120071 '). However, provided 
that on large-scales the scatter is random, there still re- 
mains a direct one to one relation between large-scale bias 
and halo mass for a given cosmology via the "universal" 
quantity f. This suggests the potential to constrain the 
mass of the dark halo hosts of reionizing galaxies by mea- 
suring the clustering strength. At slightly lower redshifts of 
2; ~ 4 — 5, clustering measurements have been made of Ly- 
a-emitters and Lyman-break galaxies, allowing estimates of 
the mas s of their host haloes (see e.g. Haman a et al. 20041: 
Kashika wa et al. I ,2 006: Hamana ct al. 200 61: iLee et al. 
20061 : iGonrov. Wechsler. fc Kravtsov I l2006l : iKovac et al." 
2003). 



Once the large-scale bias and cosmology are determined, 
the scale-dependence of halo bias, to the degree that it is 
"universal" , is also determined. Gomparison of the observed 
scale-dependence of galaxy clustering to that of haloes with 
the same large-scale bias thus provides a valuable indica- 
tor of local environmental effects on galaxy formation. The 
strong clustering and steep scale-dependence of the cluster- 
ing for high redshift haloes may cause feedback from neigh- 
bouring galaxies to affect star formation during these early 
times. 

Many possible complications to this simple picture 
will need to be considered. For example, the large scales 



We close with a brief discussion of the (perhaps sur- 
prisingly) large suppression to the halo pairwise velocity 
dispersion by finite simulation volumes. Pairwise velocity 
dispersions are reduced by up to and beyond 50% in our 
simulations, even on scales where halo spatial bias is well 
behaved. The general suppression of halo and mass veloc- 
ities may have significant effects on hydrodynamic simu- 
lations where small volumes are often required because of 
computational expense. For example, volumes of ;$, IMpc 
are commonly employed in simulations that model the cool- 
ing of gas within high redshift haloes to form the "first stars" 
(e.g. Abel, Bryan, fc Norman 2000,200 2 ; Bromm, Goppi, fc 
Larso n 1999,2002; lYoshida et aTlbOOgl : lO'Shea fc Norman I 
[2007]). This suggests the possibility that gas heating due to 
shocks of infalling material of merging haloes could be inhib- 
ited due to the suppression of pairwise velocities. Such effects 
have the potential to alter gas cooling and the ionization 
level of halo gas, which could affect star formation. How- 
ever, because the velocity suppression diminishes for close 
pairs, it is not clear whether the kinematic suppression is 
actually a major problem for gas dynamics or star forma- 
tion within simulations of the high redshift universe. Any 
such problems could be minimized by the "zoom" resimu- 
lation technique, consisting of a high resolution subvolume 
embedded within a larger and lower resolution cosmological 
volume, as in e.g. iGao et al. I (|2007l ). Further investigation 
of finite- volume effects on kinematics and any associated ef- 
fects on gas properties is warranted. 
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r APPENDIX 

Here we demonstrate the importance of taking into account 
the finite simulation volume when considering large-scale 
halo bias. By computing a^{m) from Eqn.|6] instead of Eqn. 
[S] the mean power at each discrete wavenumber present in 
the simulation is taken into account. This has the effect of 
shifting (T{mhaio), usually toward smaller values (larger val- 
ues of v). Fig. [T] which uses Eqn. [5] to compute crirahaio), 
thereby ignoring the finite volumes, shows that two large 
effects are present if finite simulation volume is not treated 
appropriately. First, there is significant run-to-run scatter 
in bias, approaching ~ 50% for the smallest volume, to be 
compared with the ~ 10% scatter in Fig. [3] where Eqn. [6] 
is used to compute (j(mhaio)- Second, the inferred simula- 
tion bias is significantly higher when finite volume is ig- 
nored, especially for the smallest boxes, and the larger haloes 
within them. This approach of finite- volume compensation 
does not correct for discreteness of phases, nor does it pro- 
vide a correction for the deviations as a result of non-linear 
coupling to only a sma ll number of modes at large scales 
l|Takahashi et al. II2008I ). However, the improvement in run- 
to-run scatter, the better fit to the SMT bias relation, and 
the better agreement with the Millennium run results, all 
provide some level of assurance that this approach of correct- 
ing for finite volume is justified. Similar reductions in run-to- 
run scatter and improved universality of the halo mass func- 
tion were demonstrated in I Reed et al. I (120071 ). strengthening 
the importance and validity of correcting for finite simula- 
tion volume in cosmological simulations. These corrections 
allow one to simulate smaller haloes, and hence achieve a 
larger effective dynamic range, while maintaining their value 
as cosmological tools. 
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Figure 7. This figure uses the sigma(irL) computed for an infinite 
volume (Eqn.O instead of that computed taking into account the 
power present in each realization (Eqn.[6ll. Otherwise this figure is 
identical to Fig. |3] (including the Millennium run results from Gao 
et al. 2005). The larger run-to-run scatter in bias and the poorer 
fit to the SMT relation demonstrate the usefulness of correcting 
for reduced power and discrete modes in finite volumes. 



